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ABSTRACT 

I revisit the question of the adiabaticity of initial conditions for cosmological pertur- 
bations in view of the 3-year WMAP data. I focus on the simplest alternative to 
purely adiabatic conditions, namely a superposition of the adiabatic mode and one 
of the 3 possible isocurvature modes, with the same spectral index as the adiabatic 
component. 

I discuss findings in terms of posterior bounds on the isocurvature fraction and 
Bayesian model selection. The Bayes factor (models likelihood ratio) and the effec- 
tive Bayesian complexity are computed for several prior ranges for the isocurvature 
content. I find that the CDM isocurvature fraction is now constrained to be less than 
about 10%, while the fraction in either the neutrino entropy or velocity mode is be- 
low about 20%. Model comparison strongly disfavours mixed models that allow for 
isocurvature fractions larger than unity, while current data do not allow to distinguish 
between a purely adiabatic model and models with a moderate (i.e., below about 10%) 
isocurvature contribution. 

The conclusion is that purely adiabatic conditions are strongly favoured from a 
model selection perspective. This is expected to apply in even stronger terms to more 
complicated superpositions of isocurvature contributions. 

Key words: Cosmology - Initial conditions - Model selection - Bayesian methods 



1 INTRODUCTION 

The detailed nature of the initial conditions for cosmological 
perturbations is one of the open questions in cosmology. The 
exquisite precision of the WMAP measurement of the first 
acoustic peak location in the cosmic microwave background 
(CMB) temperature power spectrum (£ = 220.7 ± 0.7, see 
Iffinshaw etafl i|2006l )) is a strong indication in favour of 
adiabatic initial conditions, which predict for the first peak 
£ « 220. The alternative possibility of cold dark matter 
(CDM) isocurvature initial conditions excites a sine wave 
(rather than the cosine excited by adiabatic conditions) in 
the photon-baryon plasma, resulting in a first acoust i c pea k 
displac e d by h alf a period to £ » 330, see e.g. iTrottal |2004) : 
iDurrerl ||2004T ). Furthermore, the ratio of the Sachs- Wolfe 
plateau for I <J 50 to the height of the peak is very different 
for the two modes. 

A few years ago, Bucher and collaborators introduced 
two new isocurvature modes, called "neutrino density" (or, 
more appropri ately, "neutrino ent ropy") and "neutrino ve- 
locity" modes (Buc her et al.l l2000). They are characterized 
by a non-vanishing initial entropy perturbation in the neu- 
trino sector and by a non-vanishing difference in the neu- 
trino to photon velocity, respectively. A superposition of 
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the adiabatic and the three isocurvature modes (cold dark 
matter, neutrino entropy and neutrino velocity) constitutes 
the most general initial conditions for the perturbations, 
at leas t if the Univ erse is radiation dominated in its early 
phase (Trotta 2004). A baryon isocurvature mode is obser- 
vationally indistinguishable from a CDM isocurvature one 
jBucher et al.ll200ll ; [Gordon fc Lewisll2003l ) and can thus be 
neglected without loss of generality. 

Allowing for the most general type of initial condi- 
tions has two effects on cosmological parameter extraction 
from CMB measurements. First, the extra parameters as- 
sociated with the initial conditions introduce severe de- 
generacie s which limit our ability to reconstruct the cos- 
mology (|Trotta et all l200ll . 120031 ). even though this can 
fortunately be remedied by using polariza tion information 
(|Bucher et alJl200ltlTr"otta ' fe Durrerll2004( h Secondly, it be- 
comes difficult to constrain the type of initial conditions, i.e. 
the amount of isocurvature contrib utions allowed on top of 
the predominantly adiabatic mode l|Moodlev et al.l l2004). 

Recent works have investigated general isocurvature 
admixtures in the initial conditions (IBeltran et al ] 120051 ; 
iMoodlev etaH 120041 ; iBean et al.ll2006l ). In this work I fo- 
cus on the simplest alternative to a purely adiabatic power 
spectrum, namely a superposition with one totally (anti- 
) correlated isocurvature mode at the time with the same 
spectral index as the adiabatic one. This is partly motivated 
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by models for the generation of initia l cond it ions such as the 
curvat on (see e.g. Gordon fc Lewis! (120031 ); iLvth fc Wandsl 
(2002) and references therein), where this kind of scenario 
arises as a generic prediction. A second justification comes 
from the model selection approach used in the second part of 
this work. In comparing the simplest (i.e., purely adiabatic) 
scenario with a more complex one, it makes sense to start 
by adding a minimal number of extra parameters, and see 
whether the extended model is justified by the data. This 
model selection pers p ective has bee n recently advocated by 
iBeltran et ail (|2005l ); iTrottal |2005l ). 

This paper is organized as follows: in section [2] we in- 
troduce the parameterization of the initial condition param- 
eters space we are considering, while in section [3] we review 
some concepts of Bayesian statistics and in particular the 
model selection approach. We present our results in terms 
of parameters constraint and model comparison outcome in 
section [4] and offer our conclusions in [5] 



2 THE ISOCURVATURE FRACTION 

The most general initial conditions for cosmological per- 
turbations are described by a symmetric 4x4 matrix, M, 
with 10 free parameters representing the amplitudes of the 
pure modes (along the diagonal) and their correlations (off- 
diagonal elements) . From a phenomenological point of view, 
there are also 10 more parameters describing the spectral 
tilt of each mode and correlator. If one is willing to consider 
running of the spectral index, then this would introduce an- 
other 10 free parameters in the problem. As motivated in 
the introduction, we consider here a minimal extension of 
the simplest adiabatic model, namely a diagonal matrix 



M = diag(C,5 C) ^ ) K) 
= C-diag(l,/ ci ,/ nc ,/ nv ), 



(1) 

(2) 



where ( is the amplitude of the curvature perturbation (adia- 
batic mode), S c , S y are the (gauge invariant) entropy pertur- 
bations in the CDM and neutrino component defining non- 
vanishing CDM isocurvature and neutrino entropy modes, 
respectively. The neutrino entropy mode is often referred 
to as "neutrino density". The quantity V v corresponds to 
a non-zero neutrino-photon velocity givin g rise to a neu- 
trino velocity mode (see e.g. I Trotta! (|2004l ) for precise def- 
initions). The quantities /ci,/ nc ,/nv give the isocurvature 
fractions with respect to the curvature perturbation, where 
the notation employed is ci = CDM isocurvature, ne = neu- 
trino entropy and nv = neutrino velocity. The sign of f x 
(with x — ci, ne, nv) determines the nature of the correla- 
tion: a positive correlation (f x > 0) results in extra power 
to the Sachs-Wolfe plateau, a negative correlation (f x < 0) 
subtracts power in the region £ <, 50. As already mentioned, 
we take a common spectral index for the adiabatic and the 
isocurvature mode, n B , and we analyse separately the three 
scenarios where only one of the isocurvature modes is non- 
zero, in addition to the adiabatic mode. 

An alternative parameterization for the isocurvature 
contribution that is common in the literatu re is given in 
terms of t he variable a x , or a x (used e.g. bv lBeltran et al.l 
l|2004h and lBean etHI l|2006t )). This is related to f x by 



From a phenomenological perspective, there is little reason 
to prefer one parameterization over the other. However, from 
a model selection point of view the choice of the variable one 
puts flat priors on is of great importance, since the avail- 
able parameter space under the prior enters in the calcu- 
lation of the Occam 's factor for the model, see the discus- 
sion m ITrottal (|2005l ). We must consider the choice of priors 
as inherent to the specification of the extended model and 
different choices will lead to different conclusions since the 
Occam's razor effect is not invariant under non-linear trans- 
formations of variables. 

Once a fundamental model for the generation of the 
initial condition is specified, one can select the appropriate 
physical variable over which to impose a prior reflecting our 
state of knowledge before we see the data. For instance, it 
can be argued that the f x parameterization is closer to the 
curvaton setup, while the a x choice of variable compresses 
the parameter space in the compact interval —1 < a x < 1. 
A flat prior of a x gives equal a priori accessible volume 
to adiabatic-dominated {\a x \ ^ 0.5) and to isocurvature- 
dominated (\a x \ > 0.5) models. The prior on f x is very 
much dependent on what we think the available parameter 
space is under our extended model. Therefore we discuss 
below the results of model selection as a function of the prior 
width A/, taking a flat prior in the range — A/ ^ f x ^ A/. 
This allows an easy comparison once a prior range under 
a specific model is given. We postpone to a future work a 
detailed analysis of prior selection based on first principles. 



3 PARAMETER ESTIMATION, MODEL 

SELECTION AND MODEL COMPLEXITY 

Bounds on the isocurvature fraction are derived in terms of 
high probability regions in the posterior probability density 
function (pdf) for the parameters 9 given the data d, p(0\d). 
This is obtained through Bayes theorem, 



p(9\d) = 



P(d\e)7v(9) 
p(d) ' 



(4) 



where p(d\8) is the likelihood function, ir(8) is the prior pdf 
and p(d) is the model likelihood (sometimes called "the evi- 
dence" ) of the data under the model. The model under con- 
sideration is defined by the parameter set 9 and the choice 
of the prior n(9) (we shall return to this point below). 

The model likelihood is a normalization constant inde- 
pendent on the parameters of the model, and it can be ig- 
nored as far as the parameter estimation step is concerned. It 
becomes the key quantity for model selection, and in partic- 
ular we are interested in the relative odds between the sim- 
plest, purely adiabatic model Mq and a model augmented 
by an extra isocurvature contribution, I x = (Mo,f x ), with 
x — ci, ne, nv as above. The change in our degree of belief 
in the two models after we have seen the data is described 
by the Bayes factor 



B 



P {d\M Q ) 
p(d\h) ' 



(•>) 



sign(/^ 



fx 



1 + 



(3) 



which is the ratio of the normalization constants for the 
two models in Bayes theorem, Eq. Q. Since the two mod- 
els are nested (i.e., we obtain Mo from I x by setting the 
isocurvature fraction to zero, f x =0), the Bayes factor can 
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be conveniently co mputed using the Savage-Dickey density 
ratio (SDDR) (see iTrottal (|2005l 'l and references therein) 

This is easy to compute from a Monte Carlo Markov Chain 
(MCMC) , requiring only knowledge of the properly normal- 
ized posterior over the extra variable f x of the extended 
model. Furthermore, using the SDDR has the advantage 
that the impact of a change of prior can usually be eval- 
uated by simply post-processing a chain including the new 
prior. This is the approach used below in section U If the 
posterior pdf is well approximated by a Gaussian distribu- 
tion with mean /i and standard deviation a, and for a flat 
prior in the range —A/ ^ f x ^ A/, the Bayes factor (JSJ 
becomes 

b = JI*L Uc (_ MzAA _ orfc (m*l)] " . 

V 7T g [ V \/2cr ) V y/2a J 

(7) 

For lnB > model Mq is favoured over I x because the 
extra complexity (in terms of wasted parameter space) of 
I x is not warranted by the data, while for In B < I x is 
favoured since the data require the extra parameter. A use- 
ful rule of thumb ijKass fc Raftervlll995h is that a positive 
(strong) preference requires | In B\ ^ 1 ( <^ 3). A model like- 
lihood ratio | lnBj > 5 (corresponding to odds > 150 : 1 is 
deemed to constitute "decisive" evidence.) 

Finally, the last relevant quantity for our analysis is the 
Bayesian model complexity, which measures the number of 
parameters the data can support, regardless of whether the 
parameters in question a re act ually detected or not (for more 
details, see lKunz et al.l (|2006)). The Bayesian complexity is 
defined as 

C 6 = PW-X 2 W, (8) 

where the effective X 2 (#) i s derived from the likelihood as 
X 2 (9) — —2\ap(d\0). The bar denotes an average over the 
posterior pdf, while the hat denotes a point -estimator which 
in our case we take to be the mean under the posterior, 
i.e. 8 = 8. We will use Cb to quantify the number of sup- 
ported parameters in our extended models I x , in order to 
verify whether the isocurvature fraction is a variable that 
could have been detected using current data. A detailed dis- 
cussion of the meaning an d interpretat i on of the Bayesian 
complexity can be found in iKunz et all (|2006h . 

It is important to stress that both the model likelihood 
and the Bayesian complexity depend not only on the data 
but also on the model description one choose s to adopt, i.e . 
on the prior choices one makes for 7r (/„) fsee lTrottal jjooj ) 
for an example applied to the case of dark energy models). 
This is an irreducible feature of the Bayesian model selection 
approach. It seems to us that there cannot be an absolute 
notion of "a best model" , but only relative statements about 
the support the data give to different models when compared 
to each other. Furthermore, the very concept of Bayesian 
complexity is only meaningful when the constraining power 
of the data is compared to the scale of the problem at hand, 
which again must be defined by specifying the prior. 

Our simplest model Mo is a flat ACDM Universe with 
purely adiabatic conditions, described by following set of 6 



parameters 

8 = (C,Ub,Vc, @*,n 3 ,T r ) (9) 

where £ is the curvature perturbation, uib,uJ c are the phys- 
ical densities of baryons and CDM, respectively, O* is the 
ratio of the sound horizon to the angular diameter distance 
to last scattering, n s is the spectral tilt and T r the opti- 
cal depth to reionization. An extra bias parameter b for the 
matter power spectrum is treated as a nuisance parameter 
and marginalized over, hence we do not count it as an ad- 
ditional parameter. We do not consider tensor modes nor 
extra neutrino species nor running of the spectral index. We 
take our 3 neutrino families to be massless and we fix the 
dark energy equation of state to w = — 1 at all redshifts. 
All of those choices are motivated by the fact that inclusion 
of any of the above extra parameters is presently not re- 
quired by the data. This means that a comparison between 
a model including both the isocurvature fraction and one 
of the above extra parameters against the simple adiabatic 
model would favour even more strongly the latter, as a con- 
sequence of the extra Occam's factor effect coming from the 
extra parameter. In this sense, our model selection is actu- 
ally conservative. 

The situation is different for parameter constraints, 
since in this case strong degeneracies between the isocurva- 
ture fraction and other extra parameters might change the 
posterior bounds of f x . In particular, one can expect a strong 
degeneracy between the CDM isocurvature mode and the 
presence of a tensor mode from gravity waves, when consid- 
ering temperature power spectrum information alone. The 
extra power contributed by the CDM isocurvature to the 
Sachs-Wolfe plateau for small I's is strongly anti-correlated 
with the tensor mode amplitude. However, inclusion of po- 
larization data would help in breaking this degeneracy, at 
least partially. The impact of allowing a tensor mode contri- 
bution is very mild for the neutrino modes, since the Sachs- 
Wolfe plateau is lower than the first acoustic peak for these 
modes, and as a consequence constraints on their amplitudes 
are dominated by the height of the peak, not by the height of 
the plateau. Similar considerations also apply for a possible 
running of the spectral index. Other parameters that mainly 
affect the angular diameter distance to last scattering and 
therefore the position of the acoustic peaks in the spectrum 
(such as the dark energy equation of state, the curvature of 
spatial sections or an extra background of relativistic parti- 
cles) present only weak degeneracies with the isocurvature 
fractions, since the peaks' position is strongly constrained 
by the data. 

In the following, we therefore limit our analysis to the 
6 parameters model Mo described above, complemented in 
the extended models by the isocurvature fractions as follows. 
The extended models I x contain a non-vanishing isocurva- 
ture fraction 

Ix = (0, fx) (10) 

where f x is defined in Eq. ([2| and x = ci, ne, nd. The spectral 
index of the isocurvature mode is the same as the adiabatic 
one, n s . The correlation coefficient between the adiabatic 
and the isocurvature mode is ±1, depending on the sign of 
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Table 1. Constraints on the isocurvature fraction f x , from 
WMAP3 and other CMB data, and SDSS matter power spec- 
trum measurements. We give the posterior mean fi and standard 
deviation <r, as well 1— dimensional marginalized regions encom- 
passing 95% of posterior probability. 
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Isocurvature fraction f x 

Figure 1. Normalized posterior probability density for the isocur- 
vature fraction parameter f x . CMB and large scale structure data 
are compatible with purely adiabatic initial conditions, with a 
slight tendency towards negatively correlated isocurvature com- 
ponents. 



4 RESULTS AND DISCUSSION 

In this section we present our results about the isocurva- 
ture fraction in terms of posterior bounds, Bayesian model 
selection and effective model complexity. 

We us e the WMAP 3-year temperature and polariza- 
tion data (jHinshaw et alj l200fj ; iPage et all 120061) supple- 
mented by small-scale CMB measurements ( Readhead et al.l 
12004 iKuo et al.ll2004 ). We add the Hubble Space Telescope 
measurement of the Hu bble constant Hq — 72±8 km/s/Mpc 
l|Freedman et alj l200ll ) and the Sloan Digital Sky Survey 
(SPSS) data on the m atter power spectrum on linear scales 
jTegmark et alj|2004h . 

In Figure Q] we plot the 1-dimensional, marginalized 
posterior pdf on the isocurvature fraction parameter f x . We 
we have adopted a flat prior of f x of width much larger 
than the posterior, so that the range of the prior does not 
influence the result. The isocurvature fraction is compatible 
with zero for all three isocurvature modes, with a slight shift 
of the peak of the pdf to negative values. This corresponds 
to a negative correlation, in which case the contribution to 
the large scales CMB power due to the isocurvature auto- 
correlation spectrum is largely compensated by the negative 
correlator. The posterior mean and standard deviation for 
f x are given in Table Q] as well as 1-dimensional marginal- 
ized intervals encompassing 95% of probability. We find that 
the isocurvature fraction for the CDM mode is constrained 
to be -0.10 < fd < 0.06 (95% probability), while for the 
two neutrino modes we obtain —0.20 < / nc < 0.12 (neu- 
trino entropy) and —0.18 < / nv < 0.22 (neutrino velocity). 
We notice that the tightest constrained mode is the CDM 
isocurvature one. This is because with our definition of f x , 
for a given value of f x the CDM isocurvature is the mode 
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with the largest contribution to the CMB power spectrum. 
Also, all of the 1-dimensional pdf's for f x are very close to 
Gaussian. Hence we expect that Eq. J7J is a good approxi- 
mation to the Bayes factor, Eq. (J6j , as we now show. 

We now evaluate the Bayes factor between the mod- 
els including an isocurvature contribution and the simplest, 
purely adiabatic model. As we have seen above in the pa- 
rameter extraction step, there is no indication that the data 
require and isocurvature component, since the isocurvature 
fraction is compatible with . This is consistent with the 
findings of Bean et al. (2006). We therefore expect the Bayes 
factor to favour the purely adiabatic model on the ground 
of the Occam's razor argument. The strength of evidence 
in favour of the adiabatic model depend on the amount of 
wasted parameter space for the isocurvature fraction, i.e. by 
the prior range A/. In the top panel of Figure[2j we plot the 
Bayes factor as a function of the prior width A/, while in 
the bottom panel we plot the Bayesian complexity, i.e. the 
number of parameters effectively constrained by the data. 
We can see that for models with poor predictivity, i.e. a large 
prior accessible range A/ 3> 1, one finds strong (> 20 : 1) 
to decisive (> 150 : 1) posterior odds against the extended 
model for all of the three isocurvature modes. We also plot 
the Gaussian approximation to the SDDR for the Bayes fac- 
tor, Eq (0, for the CDM isocurvature mode, and find a very 
good match with the value computed numerically from the 
Monte Carlo chain. 

For a prior choice A/ ^ 1, the Bayesian complexity is 
close to 7, indicating that all of the 7 parameters of the ex- 
tended model have been measured. We therefore conclude 
that models predicting up to the same amount of isocurva- 
ture to adiabatic power (the case A/ = 1) are strongly dis- 
favoured for the CDM mode, and mildly disfavoured in the 
case of the two neutrino modes. However, if the prior range 
is reduced below A/ = 1, i.e. for models predicting predomi- 
nantly adiabatic initial conditions with subdominant isocur- 
vature contribution, the Bayes factor gives an inconclusive 
result, with about equal odds for the purely adiabatic and 
the mixed models. At the same time, the Bayesian complex- 
ity decreases, indicating that f x is only poorly constrained 
with respect to the scale of the prior, especially for the neu- 
trino density and velocity modes. This reinforces the con- 
clusion that current data are not strong enough to select 
among a purely adiabatic model and one which predicts up 
to 10% isocurvature contribution and we need to acquire 
better data in order to obtain a higher-odds result. 



5 CONCLUSIONS 

We have submitted the question of the type of initial con- 
ditions for cosmological perturbations to renewed scrutiny 
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Figure 2. Result of model selection between a purely adiabatic 
model and an extended model featuring a totally (anti-) correlated 
isocurvature component, as a function of the prior available range 
for the isocurvature fraction, Af. Top panel: the Bayes factor 
strongly disfavours models with Af 3> 1 because of the Occam's 
razor effect, while models predicting an isocurvature fraction be- 
low about 10% in any of the three modes cannot presently be ruled 
out (InB < 1). The dotted line gives the Gaussian approxima- 
tion to the Bayes factor, Eq. 10, for the CDM isocurvature mode. 
Bottom panel: the Baycsian complexity gives the effective number 
of parameters the data can support. For Af <J 1 the isocurvature 
component in the neutrino entropy and velocity modes is not 
supported by the data. The errors have been computed as the 
variance of 10 random sub— chains, and the neutrino entropy and 
velocity modes have been shifted horizontally to the left and to 
the right, respectively, for clarity. 



in the light of WMAP 3-year data. We have focused on the 
simplest and well motivated alternative to a purely adiabatic 
model, namely an admixture of one totally (anti-)correlated 
isocurvature mode at the time, with the same spectral tilt 
as the adiabatic one. 

We have derived posterior bounds on the isocurvature 
fraction from WMAP 3-year data combined with other 
CMB measurements and SDSS. We have constrained the 
isocurvature fraction in the CDM mode to be less than about 
10%, while the maximum allowed neutrino isocurvature con- 
tribution (either density or velocity) is about 20%. 

Bayesian model selection tends to favour purely adia- 
batic initial conditions with strong odds (> 20 : 1) when 
compared to models predicting isocurvature fractions larger 
than unity. For such models - having a large prior range 
on the isocurvature fraction - we have shown that the data 
can support 7 parameters, but that only 6 of them are re- 
quired, with no need to include isocurvature modes from a 
model selectio n point of v i ew. T hese findings confirm the 
conclusions of iKunz et alj (|2006t ). However, mixed models 
that limit the isocurvature contribution to less than about 
10% cannot presently be ruled out. We have shown that the 
constraining power of the data for this class of models is 



insufficient, and therefore we must hold our judgement until 
better data becomes available. These findings are however 
dependent on the parameterization chosen for the isocurva- 
ture fraction, that in this work is motivated by the curvaton 
scenario. The question of prior selection will be further ad- 
dressed in a future publication. 

It is reasonable to expect that the same conclusion 
would apply in even stronger terms to the case of more 
complicated models, e.g. those involving a superposition of 
different isocurvature modes at the same time, or with ar- 
bitrary correlations among them. In fac t, more com p licate d 
models (such as the class considered bv iBean et alJ (|2006h ) 
ought to be even more disfavoured because of their larger 
volume of wasted parameter space. At present, Occam's 
razor is perfectly compatible with the simplest possibility, 
namely purely adiabatic initial conditions. 

We think that this model comparison approach can be 
a useful complement to parameter constraints analysis, and 
that it can offer valuable guidance in building models for the 
generation of primordial perturbations. 
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